2,769 research outputs found

    A Comparison of Blocking Methods for Record Linkage

    Full text link
    Record linkage seeks to merge databases and to remove duplicates when unique identifiers are not available. Most approaches use blocking techniques to reduce the computational complexity associated with record linkage. We review traditional blocking techniques, which typically partition the records according to a set of field attributes, and consider two variants of a method known as locality sensitive hashing, sometimes referred to as "private blocking." We compare these approaches in terms of their recall, reduction ratio, and computational complexity. We evaluate these methods using different synthetic datafiles and conclude with a discussion of privacy-related issues.Comment: 22 pages, 2 tables, 7 figure

    Evaluation of Monilinia fructicola on apricot fruits

    Get PDF
    Monilinia fructicola has been a quarantine pathogen in Europe until 2014; however, the disease risk remains large for Prunus species, because of the continuing spreading around Europe. In order to assess the impact of this fungus on apricot fruits, differences in variety susceptibility need to be evaluated

    ERBlox: Combining Matching Dependencies with Machine Learning for Entity Resolution

    Full text link
    Entity resolution (ER), an important and common data cleaning problem, is about detecting data duplicate representations for the same external entities, and merging them into single representations. Relatively recently, declarative rules called matching dependencies (MDs) have been proposed for specifying similarity conditions under which attribute values in database records are merged. In this work we show the process and the benefits of integrating three components of ER: (a) Classifiers for duplicate/non-duplicate record pairs built using machine learning (ML) techniques, (b) MDs for supporting both the blocking phase of ML and the merge itself; and (c) The use of the declarative language LogiQL -an extended form of Datalog supported by the LogicBlox platform- for data processing, and the specification and enforcement of MDs.Comment: To appear in Proc. SUM, 201

    A procedure for hydrogel contact lens base curve verification

    Get PDF
    A procedure for hydrogel contact lens base curve verificatio

    Production of diosgenin by hairy root cultures of Trigonella foenum-graecum L

    Get PDF
    Hairy root cultures ofTrigonella foenum-graecum L. were established withAgrobacterium rhizogenes strain A4. The hairy roots produce diosgenin, an important spirostanol for the semi-synthesis of steroid hormones. Fourteen different liquid media were investigated. The fastest growth was obtained in McCown's woody plant (WP) medium supplemented with 3% sucrose; the highest diosgenin content was observed in half-strength WP medium with 1% sucrose (0.040% dry weight), which represents almost twice the amount detected in the 8-month-old non-transformed roots (0.024%). A time-course study in WP liquid media supplemented with 3% sucrose was undertaken. In these conditions, 17 μg diosgenin/g fresh weight were produced. The influence of cholesterol, medium pH and chitosan on diosgenin production was tested. The addition of 40 mg/l chitosan elevated the diosgenin content to three times that found in non-elicited hairy root

    Different methods of evaluation of Monilinia laxa on apricot flowers and branches

    Get PDF
    - Organic apricot production is currently not profitable. - The main obstacle to sustainable profitability is brown rot caused by the fungus Monilinia laxa (Aderh. & Ruhl). - In the current apricot germplasm no source of total resistance has been shown, but some varieties are expressing interesting levels of tolerance. - A good evaluation of the M. laxa symptoms is essential for a precise diagnosis of the infection and to appreciate differences between tolerant and susceptible varieties and genotypes

    Misfolding and aggregation of nascent proteins: a novel mode of toxic cadmium action in vivo.

    Get PDF
    Cadmium is a highly poisonous metal and a human carcinogen, but the molecular mechanisms underlying its cellular toxicity are not fully understood. Recent findings in yeast cells indicate that cadmium exerts its deleterious effects by inducing widespread misfolding and aggregation of nascent proteins. Here, we discuss this novel mode of toxic heavy metal action and propose a mechanism by which molecular chaperones may reduce the damaging effects of heavy metal ions on protein structures
    corecore